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HIGH-LEVEL EXPRESSION AND EFFICIENT RECOVERY OF 
UBIQUITIN FUSION PROTEINS FROM ESCHERICHIA COLI 

Field of Invention: 

The invention generally relates to 
the field of molecular biology and more specifically 
to fusion proteins involving Ubiquitin and their use 
to scale up fermentation processes and enhance 
yields of recombinantly produced proteins. 

Background: 

Ubiquitin has been used as a fusion 
partner for the synthesis of recombinant peptides 
and proteins with other host -vector systems. 
Ubiquitin fusions expressed at 20% of the total cell 
protein, using the yeast ubiquitin coding sequence 
under control of the lambda repressor in E. coli 
strain AR58, have been reported (2) . Peptide 
libraries have been synthesized as ubiquitin fusions 
at yields of 60 mg/liter of culture (20) . Yields as 
high as 1 mg of peptide product per liter of 
bacterial culture have been reported by Yoo and 
Rechsteiner (7) . However, there is no literature 
pertaining to the performance of ubiquitin fusion 
expression systems in fermentor cultures. 

Ubiquitin, a 76 amino acid protein, 
is an integral part of the pathway that mediates 
protein. degradation in eukaryotic cells. Ubiquitin 
is covalently attached to intracellular proteins 
that are targeted for rapid turnover. Branched 
chains of ubiquitin are then conjugated to the 
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first, allowing the target protein to be directed to 
and/or recognized by the proteosome, a multisubunit 
macromolecular structure that degrades the target 
protein to amino acids and peptides and recycles the 
ubiquitin monomers (1) . Ubiquitin itself is 
extremely resistant to proteolytic degradation and 
has a long half-life (180 minutes) when expressed in 
E. coli (2) . Several proteins have been expressed 
as C- terminal ubiquitin fusions, some of which 
cannot be produced at all in bacteria (3, 4) or 
yeast (5, 6) as independent entities. Even short 
peptides have been produced efficiently in E. coli 
as ubiquitin fusions (7) . Ubiquitin is thought to 
stabilize the nascent peptide chain as it comes off 
15 the ribosome during translation, and to act as a 
chaperonin to fold the recombinant protein more 
efficiently (2) . 

Several issues remain problematic 
with E. coli expression including the avoidance of 
proteolysis and the production of proteins in a 
soluble or native state (8) . In E. coli, 
recombinant proteins have been expressed at levels 
as high as 50% of the total cellular protein, 
particularly at low cell density (9) . While this 
25 should result in tens of grams per- liter in high 

cell density fermentations, production at this level 
has not been reported. This is partially due to a 
decrease in the metabolic efficiency of E. coli at 
high cell density, and to increased in vivo protein 
3 0 turnover. To ensure adequate expression, nutrients 
may be supplied during the induction phase of an 
extended fed-batch culture (10, 11) . However, 
addition of nutrients may result in lower yields 



20 
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because of nutrient interaction with various 
intracellular pathways and functions. For example, 
addition of excess glucose often results in 
increased organic acid production which decreases 
5 overall yield (e.g., acetate, (12)}. High 

concentrations of TCA-cycle intermediates and amino 
acids have resulted in reduced protein yield due to 
the altered regulation of the TCA cycle, which must 
be fully operative for maximum expression (13) . 

10 Many reports have indicated that 

sensitivity to proteolytic attack is also 
responsible for low expression levels at high cell 
density. Most often, differences in stability are 
attributed to the specific protein (state of 

15 folding) and not to the conditions by which the 

proteins were expressed (14) . . Indeed, many protein- 
specific characteristics have been elucidated which 
lead to enhanced degradation. Yet, it is also well 
known that the activity of the attacking proteases 

20 or peptidases depends on cellular stress (15, 16) . 
For example, Harcum and Bentley demonstrated that 
induced expression of chloramphenicol -acetyl - 
transferase (CAT) elicits several proteolytic 
activities that also appear during the stringent 

25 stress response (17, 18) . Others have noted that 
overexpression of "abnormal" proteins amplifies 
stress protein levels and protease La (19) . Since 
ubiquitin is an eukaryotic stress protein, it 
appears to be less sensitive to the elevated 

3 0 cellular stress that often appears in bacteria 
during overexpression conditions. 



SUMMARY OF THE INVENTION 
An object of this invention is to 
avoid or minimize proteolysis with E, coli and like 
expression systems and to enhance the production of 
proteins in a soluble or native state. A further 
object of the invention is the consistent high 
specific yield. A still further object is to obtain 
over one gram per liter yield of fusion protein when 
the fusion is soluble and the two-step purification 
applies . 

The combination of optimal codons for 
individual peptide fusions are thought to be in part 
responsible for the high specific yields which makes 
this technology so valuable. Focussing on 
maximizing biomass in the cultures as this invention 
does, rather than manipulating the physiology of the 
organism, increases the specific yield, which 
increases overall yield by decreasing the amount of 
downstream processing. 

The term fermentation, as used 
herein, is intended to refer to processes involving 
the production of recombinant protein products. 

In the system of this invention, 
ubiquitin fusions are employed. They are extremely 
stable and are expressed at high levels in soluble 
form. E. coli has been used as a model for 
ubiquitin-peptide fusion systems in 10 -liter batch 
cultures. Ubiquitin fusion proteins are produced 
at very high levels in E. coli and related hosts. 
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The specific yield is defined as the percentage, 
taken as a ratio, of recombinant protein product to 
total cellular protein, as measured by densitometry 
of SDS-PAGE gels run on whole cell samples lysed in 
5 loading buffer and loaded directly. Specific yields 
in the E. coli ubiquitin ..fusion system, grown and 
induced as described, exceed 50% and approach 90%. 
The highest reported accumulation of a recombinant 
protein in E. coli is 50% of the total cellular 

10 protein, i.e., 50% specific yield (9). Another E. 
coli expression system claims 40% specific yields 
for some recombinant proteins (29) . Under the 
conditions specified in the invention, this level of 
expression exceeds 50% specific yields which results 

15 in yields of recovered fusion protein of over one 
gram of fusion protein per liter of bacterial 
culture. This protein is both soluble and 
recoverable. The majority of protein in 
supernatants of lysed cells is product. The fusion 

20 can be further purified from host proteins with an 
85°C heat step, in which most of the host proteins 
precipitate while the ubiquitin fusion stays in 
solution. 

The ubiquitin fusion process is 
25 outlined in Figure 1. Briefly, the process involves 
cloning the peptide or protein of choice as a C- 
terminal ubiquitin fusion into a suitable expression 
vector. Suitable peptides or proteins include, but 
are not limited to, a size range between 2 amino 
acids and 6000 amino acids, more typically between 6 
amino acids and 500 amino acids, and specifically 
between 8 amino acids and 70 amino acids as shown by 
sequences specified in Table 4. 



30 
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T7, trc, lac, and ApPL are preferred. 
Additional suitable promoters include lacUVS, tac, 
trc, trp, lexA, malE, araC, and nifA. 

According to the invention, clones 
5 are screened for good expression and the correct DNA 
sequence using techniques which are well known to 
those of ordinary skill in the art. Fusion proteins 
are then expressed and isolated and the product is 
enzymatically released from ubiquitin by an 

10 ubiquitin-specif ic protease. The specificity of 

these proteases is exquisite so that product is not 
degraded. These proteases include, for example UCH- 
Ll, UCH-L2, UCH-L3, YUH1, UBP1, UBP2, UBP3 , UBP4 , 
UBP5, UBP6 and fat facets (Drosophila) . See also 

15 U.S. Patent No. 5,212,058 and International 
Application No. WO 91/17245 in this regard. 
Furthermore, any N- terminal amino acid on the 
product is possible because the cleavage site always 
follows the glycine- 76 residue of ubiquitin. Once 

20 released, the product is purified and lyophilized. 

The invention also includes a means 
of amplifying expression levels by coupling a 
chemical induction using, for example, 
isopropylthiogalactoside (IPTG) with heat shock. A 

25 12°C heat shock, upon induction of the ubiquitin 
fusion with IPTG, produces an excellent yield of 
recoverable protein. Heat precipitation of host 
proteins is dependent upon the conditions under 
which the culture is grown and induced. A heat 

3 0 shock coupled with induction results in the 

stabilization of the host proteins, such that the 
benefits of the 85°C heat purification step are 
lost. However, under these conditions, the 
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ff ubiquitin fusion is overexpressed to greater than 

90% of the total soluble protein, making the 85°C 
purification incubation unnecessary. The 
extraordinary stability of ubiquitin to heat and to 
5 proteases/ coupled with the high levels of 

expression attainable in.E. coli, make it an ideal 
fusion partner for a bioprocess. 

In addition, the invention includes 
the development of a transformant strain of E. coli 

10 that overexpresses a recombinant protein in excess 

of 50% of the total protein, preferably at least 1.0 
gram per liter. (See strain A37 as an example) . 
Transformation of the host is effected by 
expression plasmids, such as pDSUb, pPXl, pRSET-Ub, 

15 pPL-Ub. Other suitable host cells can be employed, 
such as bacteria other than E. coli, yeast, insect, 
plant or mammalian cells. 

Further, the invention includes 
processes for growth of bacterial and other host 

2 0 cells and induction of synthesis of a recombinant 

protein that yields the desired product in excess of 
50% of the total protein or one gram per liter of 
bacterial culture. The induction includes chemical 
induction of synthesis of a recombinant protein, and 

25 if desired, simultaneous physiological stress to 
enhance product yield in excess 5 0% of the total 
protein or of one gram per liter of the culture. 
IPTG, lactose and other chemicals known in the art 
may be used as the agent for chemical induction. 

30 These include, inter alia, tryptophan, sugars and 
analogs, maltose, arabinose, nalidexic acid, 
oxalinic acid and nitrogen. The physiological 
stressing agent includes heat shock as well as other 



8 

conventional techniques, which may be simulated by 
the presence of nalidixic acid or other related 
agents known to induce heat shock genes. 

The process of the invention results 
in a cell lysate where the bulk of the cell lysate 
is the desired recombinant product, i.e. 10% or less 
of the recovered protein consists- of host protein, 
and 90% of the protein consists of recombinant 
protein. The process generally includes a minimum 
of 2 steps to produce a recombinant protein product 
from crude cell lysates. 

The protein product of this invention 
is intracellular and not secreted protein 
(periplasm) as in some other conventional systems. 
Another unique feature of the invention is that for 
the bulk of the clones, the protein remains soluble 
at this level of expression, a major benefit. 

The invention demonstrates a new 
process for growth and induction of bacterial 
cultures expressing recombinant proteins from the 
laboratory bench to the pilot scale for production. 
In previous systems, increased protein turnover and 
a decrease in the metabolic efficiency of £. coli at 
the high cell densities achieved in the fermentor 
resulted in failures in scale-up. In these systems, 
there was a lack of correlation between increased 
culture volume, increased cell density and increased 
specific yield of product. These limitations do not 
occur in the present invention. The data indicate 
that: 1) yields in shaker flask cultures are 
directly scalable to the 10 liter fermentor, 2) 
higher cell densities actually augment the specific 
yield of ubiquitin fusion proteins, and 3) the 
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ubiquitin fusion expression clone' makes about 3 fold 
more total protein, including host proteins, than 
the parent strain of E. coli. A series of 3 
fermentations with varied temperature shift 
5 protocols showed that a maximal heat shock of 12 °C 
(from 30°C to 42°C) , done simultaneously with 
induction, gave a maximal specific yield (over 90% 
of the total soluble protein by densitometry, 709 
mg/liter by protein assay) , in which the 
10 recoverable ubiquitin fusion product comprised 16% 
of the wet weight of the cell paste. These results 
demonstrate the great value of ubiquitin fusion 
technology for the economical large scale production 
of peptides. 

15 

BRIEF DESCRIPTION OF THE DRAWINGS 

Pig. 1: Flow diagram of the 
ubiquitin fusion process for making recombinant 
peptides in bacteria. 



Fig. 2: Time course and analysis of 
fermentations D68 and D69. Comparison of growth 
curves (circles) versus specific yield (diamonds) as 
measured by densitometric analysis of whole cells 
samples in SDS-PAGE gels as described in materials 
and methods. Open markers are D68 datapoints and 
filled markers are DBS datapoints. 

Fig. 3: Time course analysis of 
protein expression for fermentation D72. Culture 
samples were taken at one hour time points prior to 
induction (samples 1-4) and half hour time points 
following induction (samples 5-9.) Samples were 
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processed and SDS-PAGE gels were run as described in 
experimental methods. Each time point is 
represented by three lanes; "W" for .whole cells, »L" 
for lysate, and »H« for heat supernatant. Ubiquitin 
standards are shown in lanes marked »U" and protein 
standards are shown in lanes marked "S" . Lane 15 
contains an "L" sample from the last timepoint prior 
to harvest for D81. 



Fig. 4: Growth curves for 
fermentations D72, D73, D74, D80 and D81. Optical 
density at 600 nm is plotted as a function of time. 
Cultures were induced at time zero with 1 mM IPTG. 
The solid line represents a regression through D72, 
D80 and D81 datapoints. 

Fig. 5: Comparison of time courses 
of specific yield estimates for fermentations D72, 
D73, and D74 derived by densitometry on whole cell 
samples. »'W" (panel A) versus clarified lysate 
samples "L" (panel B) . 



Fig. 6: Comparison of specific yield 
estimates for fermentations D72, D73, and D74. 
Derived by densitometry on whole cell samples "W", 
clarified lysate samples »L», and heat supernatants 
"H". Open spaces at the tops of bars represent +1 
standard deviation from the mean. 



Fig. 7: Comparison of soluble 
protein recovered from time point samples in 
fermentations D72, D73, D74, D80 and D81. Protein 
concentration was quantitated in each "L" sample a 
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described and plotted as a function of the cell 
density at the time each sample was taken. The 
protein/O. D, values shown in the inset represent the 
average and standard deviation of the final five 
datapoints for each fermentation. 



Figure 8: Growth curves for 10 liter 
fermentations in which ubiquitin peptide fusions 
were expressed. Fermentation runs of different 
fusion proteins are designated as shown in Table 4. 

10 Figure 9: SDS-PAGE analysis of the 

bacterial lysates. Lane 1: ubiquitin standard, 4 fig 
(Sigma), lanes 2, 4, 6, 8, and 10 contain 3 fil of a 
1:10 dilution of each lysate in order as follows: 
D95, D85, D86, D106, D96 , lanes 3, 5, 7, 9, and 11 

15 contain UBP digests of the preceding lysate. Lane 
12 contains a mock digest of D96 without the UBP. 
Lane 13 contains a molecular weight standard 
(Novex) 



20 



Figure 10: HPLC analysis of the D85 
bacterial lysate, cleavage reaction and the peptide 
product. Reverse phase HPLC was done with a 
gradient of 25-4 0% buffer A to B run over 18 
minutes . Panel A shows an ubiquitin standard 
(Sigma) elution profile. In panel B, a 1:10 
25 dilution of the D85 bacterial lysate was mixed with 
an ubiquitin standard, showing the relative purity 
of the ubiquitin fusion protein, and its location in 
the gradient elution relative to ubiquitin. Panel C 
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shows the cleavage reaction of the D85 lysate, with 
ubiquitin and a clean peptide (see arrow) as 
products of a complete reaction. Panel D shows the 
purified peptide product analyzed in 15-30% gradient 
5 run over 3 0 minutes. 

Figure 11: FAB Mass Spec Analysis of 
the peptide product. Panel A shows the "chemical 
average" of the product while panel B shows a high 
resolution scan. The product corresponds exactly to 

10 the predicted molecular weight of the peptide. FAB 
mass spec has been used to verify that the molecular 
weights of the peptides released from the D86 and 
D95 fusions are correct. The predicted molecular 
weight of the purified D85 peptide complexed with 

15 TFA is 2089.15 Daltons. The peptide peak is shown 
with an arrow. 

Figure 12: A. Plasmid PTre99A. B. 
Nucleotide sequence encoding human ubiquitin (SEQ ID 
NO:l) . 

2 0 DETAILED DESCRIPTION OF THE INVENTION 

Materials and methods 

Bacterial Strains and Plasmids. The host 
strain, E , coli, DHSaF'IQ, was purchased from Life 
Technologies, Inc., and transformed with the 
25 ubiquitin expression vector, pPX2 . pPX2 is a 

derivative of pDSUb that contains a 17 amino acid C- 
terminal extension of ubiquitin. pDSUb is a 
derivative of pDS78/RBSII that expresses ubiquitin 
(25) . pPX2 was constructed by inserting a PCR 

3 0 product encoding the peptide and fusion junction in 
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a directional cloning into pDSUb that had been 
digested with A flu and PstI, purchased from New 
England Biolabs and used according to their 
instructions. Transformants were selected with 100 
fig /ml ampicillin, grown for two days at 30°C. The 
ubiquitin fusion is overexpressed by the addition of 
IPTG which titrates the lac repressor (DHSaF'IQ 
contains an upregulated lad* gene on the F' factor 
which is selected with 10 fig /ml neomycin.) 
Transformants are screened in 10 ml cultures for 
expression of a fusion protein of the appropriate 
size. Plasmid DNA is subsequently prepared from 
clones that exhibit an overexpressed, induced 
ubiquitin fusion protein, and sequenced by the 
15 dideoxy method using the Sequenase Version 2.0 kit 
(United States Biochemical . ) Positive clones are 
then frozen down and stored. Clone A3 7, used in the 
Examples, was maintained on LB plates containing 
ampicillin (100 /xg/ml) , neomycin (10 fig/ml) and 1% 
20 glucose, at 30°C. It is streaked weekly for up to 
10 passages, after which a fresh streak is taken 
from a frozen seed vial for serial culture, to 
insure strain authenticity. 

A New transformed strain of E. coli 

25 This invention includes transformants 

having unexpectedly better characteristics than 
parent strains : increased protein content over parent 
strain, increased expression levels and atypical 
growth curves. 

30 These unique characteristics are evidenced 

by the following findings: 
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1. Transformation frequency in cloning 
experiments is 10 5 -10 6 times lower than expected for 
the amount of DNA used and the level of competence 
of the cells. This discrepancy of 10 6 is the same as 

5 the frequency with which random single mutations 
occur in E. coli. 

2. Since ubiquitin is toxic to the 
bacteria, any strain that overproduces fusions is 
somehow tolerized to the intracellular presence of 

10 ubiquitin. In fact, the ubiquitin-EGF2 expressing 

plasmid in A 37 (pPX2) is very stable, even when the 
ampicillin selection is removed. Attempts to "cure" 
four of the overexpressing strains of their 
ubiquitin fusion expression plasmids were made. The 

15 greatest overexpresser, the Ub-BMP clone, could not 
be cured. 

3. After curing the EGF2 expression 
strain we transformed in full length EGF and 
uteroglobin expression plasmids. These were 

20 expressed better in the cured strain than the 
original clones. 

4. In contrast to the parent strain, the 
expression strain does not grow when a colony on 
rich media is used to inoculate minimal liquid media 

25 with glucose as the only carbon source. 

5. The fermentation profiles of the EGF2 
strain for expression level and growth are atypical. 
The parent strain does not exhibit the same 
"behavior 11 . 

30 6- The EGF2 -producing strain exhibits a 

greater protein content than does the parent strain. 
This is shown in Fig. 7, which displays soluble 
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protein as a function of time during the 
fermentations. For the parent strain the protein 
content is a flat line, while the protein content 
increases dramatically for the ubiquitin fusion EGF2 
5 strain. The gel data indicate that this difference 
is not due simply to the Induced expression and* 
production of the ubiquitin fusion, but that host 
proteins are elevated as well. 

Growth conditions. Experiments with 

10 A3 7 were performed in 14 L New Brunswick Microferm 

bioreactors (10 L w.v.) at the UMD Bioprocess Scale- 
up Facility. 2X YT medium without glucose, was 
utilized in all fermentations and was prepared as 
described by Sambrook (26) . In order to provide 

15 homogeneous plasmid-bearing cell populations, 

ampicillin (100 /zg/ml) was added to the media. 500 
ml starter cultures in 1 L shake flasks were grown 
from single colonies by overnight incubation (16-2 0 
hours) in LB media (prepared according to Rodriguez 

20 and Tait (27)) with ampicillin (100 /zg/ml) , neomycin 
(10 tig/ml) and 1% glucose. Shake flask cultures 
were maintained at 30 °C and 200 rpm in a New 
Brunswick reciprocating environmental shaker, 
Bioreactors were monitored and controlled to pH 7.0. 

25 Dissolved oxygen was maintained above 30% of air 

saturation. The inoculum used for all experiments 
was approximately 5% v/v. Cells were induced by 
IPTG addition to 1 mM at OD 600 = 2.0. In some 
experiments, the temperature was raised to 4 2 °C 

3 0 simultaneously with induction. 

Analytical Measurements. Cell Mass 
Density Determination: A Bausch & Lomb Spec 21 DV 
spectrophotometer was used to measure the optical 



density (O.DJ of the samples at 600 nm. Samples 
above the linear range (0-0.25 OD fi00 units) were 
diluted with deionized water. 

Protein gels. One milliliter samples 
of growing cultures were taken at half hour 
intervals throughout each fermentation. The cells 
were pelleted with a 5 min. centrifugation in a 
Sorvall MC 12V microfuge at a setting of 6. Cell 
pellets were then frozen on dry ice and stored at - 
80°C until processing. To generate protein samples 
for SDS-PAGE electrophoresis, the cell pellets were 
rapidly thawed, resuspended in 3 00 ml of lysis 
buffer (50 mM Tris, 1 mM EDTA, pH 7.5), and a 100 ml 
aliquot of the cell suspension was removed to a new 
tube'. The 100 ml aliquots were prepared directly 
for electrophoresis with the addition of 200 ml of 
2X loading buffer (100 mM Tris-HCl pH 6.8, 200 mM 
DTT, 4% SDS, 0.2% bromophenol blue, 20% glycerol), 
labelled "whole cells", and heated to 95°C for 15 
minutes. The remaining 200 ml of cell suspension 
was subjected to three cycles of freeze- thaw, 
alternating between a dry ice-isopropanol bath and a 
50 °C water bath. Samples were left in the water 
bath only up to the point at which" they, became 
completely thawed, then were immediately frozen 
again. Following the last freeze- thaw cycle, the 
samples were placed in a Branson 3200 sonic cleaner 
bath at maximum setting for 10 min. After 
sonication, the samples were split into two 100 ml 
aliquots. One set of these aliquots was placed in 
the microfuge at maximum setting for 10 min at 4°C. 
The supernatants of this set were transferred to 
fresh tubes and the pellets discarded. These 
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samples were labelled "lysate" and were prepared for 
electrophoresis with the addition of 200 ml of 
loading buffer. Five 1 ml aliquots from each sample 
were removed for protein determinations prior to 
addition of loading buffer. The last set of 
aliquots were incubated at 85°C for 10 min prior to 
centrifugation in the microfuge to remove 
precipitated host proteins and cellular debris. 
These were labelled "heat supernatant" and were 
prepared for electrophoresis with the addition of 
200 ml of loading buffer. In this manner, 
comparable sets of samples from each time point, 
based on equivalent volumes, were generated. 

SDS-PAGE gels were run using a two 
buffer tricine system in a Novex minigel apparatus, 
with 10-20% tricine gels purchased from Novex. 
Anode (bottom) buffer was 0.2 M Tris-HCl, pH 9.0. 
Cathode (top) buffer was 0.1 M Tris, 0.1 M Tricine, 
0.1% SDS, pH 8.25. The protein molecular weight 
marker, "Multi-Mark" , was obtained from Novex. 
Bovine ubiquitin, used as a standard, was purchased 
from Sigma. Gels were run at a constant current of 
4 mA until the dye marker reached the bottom of the 
gel. Whole cell samples were loaded in 5 ml 
volumes, lysates and heat supernatants were loaded 
in 10 ml volumes. Gels were stained with 0.25% 
.Coomassie Blue R250 (Sigma) and acetic acid: methanol 
(10%: 40%) and destained in the same solution minus 
the dye . 

Densitometry. Stained gels were 
scanned with a black and white Hewlett Packard 
Deskscanner 2P with Deskscan 2.0 software on a 
Macintosh IIvx computer. Images were analyzed with 
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shareware "NIH Image" version 1.55. Specific yield 
data (expressed in percent) . represents the area 
under the ubiquitin fusion peak compared to the 
total area under all peaks in the same lane. 
5 Protein assays. Protein 

concentrations were done. using the Pierce BCA assay 
kit, using a 30 min incubation at 37°C, according to 
the manufacturer's instructions. Protein 
determinations in all data reflect the 

10 concentrations found for designated samples in a 
fixed sample volume of 5 ml from each of the 
samples, prior to the addition of SDS-PAGE buffer as 
described above. Purified bovine ubiquitin (Sigma) 
was used to generate standard curves in all protein 

15 determinations . 

Analytical RP HPLC was done on a 4.6 X 250 
mm Vydac C18 column with 5 fim particle size. 
Gradients of buffers A to B (buffer A: 0.1% TFA, 
buffer B:0.1% TFA in acetonitrile) were run either 
20 25-40% over 18 minutes or 15-30% over 30 minutes 
were done. 



Enzymatic reaction. Five hundred mis 
of the D85 lysate, containing approximately 14 grams 
of ubiquitin fusion protein, was brought up to 800 
mis in UBP reaction buffer (50 mM Tris, 10 mM DTT, 1 
mM EDTA, pH 8.5). Approximately 2 mg of UBP enzyme 
was added and the reaction incubated for 8 hours at 
room temperature. The reaction was stored at 4°c 
until preparative RP-HPLC could be done. 
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Recovery of purified peptide. 

Preparative RP HPLC was done on a Biotage "kiloprep" 
instrument. 250 mis of the cleavage reaction was 
loaded and a 15-40% A:B gradient was run over 45 
minutes on a 15 x 7.5 cm column packed with vydac 
C18 resin (particle size of 15 microns.) Peptide 
was recovered in 64 0 mis of solvent and analyzed by 
analytical I I PLC. 



Quantitation of purified peptide. 

Several one ml aliquots of purified peptide 
collected from HPLC were lyphilized and the TFA 
salts were weighed on a Mettler balance (Model #) . 
The average weight of each aliquot was 1.7 mg 
corresponding to a yield of 1.088 grams from 
approximately one third of the cleavage reaction (< 
the digestion of about 4.38 grams of the ubiquitin 
fusion) . Thus, the purified peptide was recovered 
in nearly quantitative yield. 



FAB Mass spec analysis of the 
lyophilized peptide. The molecular weight of the 
lyophilized peptide was verified using FAB mass spec 
analysis by Dr. Mark Rogers of M-Sean, Inc. 

Please note that the starting materials 
are publicly available and, using the taught cloning 
criterion and procedures, overexpressing 
transformants, like A37, are can be duplicated in a 
reproducible manner. 
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EXAMPLE 1 

A. Construction of an ubiquitin expression 

vector for expression of E. coli . 
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Plasmid PTre99A was purchased from 
Pharmacia (Pig. 12A) and digested with restriction 
enzymes, Ncol and.Pstl, and the linearized vector, a 
4.2 kb fragment, was purified from an agarose gel, 
using Geneclean (BiolOl, Inc.). The nucleotide 
sequence encoding human ubiquitin shown in Fig. 12B, 
with codon usage optimized for expression in 
bacteria per Andersson and Kurland (28) , was 
amplified by PCR with oligonucleotides specific for 
the 5' and 3' end, and containing restriction sites, 
Ncol and Pstl. The PCR fragment was digested and 
gel purified as described for the vector. The 
vector and ubiquitin fragments were then ligated and 
the ligation mix was transformed into DHSaFIQ 
competent cells obtained from BRL. Several colonies 
were screened for the insert by restriction digests 
of plasmid preparations. Plasmids with the correct 
restriction map were verified by DNA sequencing of 
the ubiquitin encoding DNA. Correct clones were 
then screen for this IPTG- inducible expression of 
ubiquitin. One of the ubiquitin-expressing clones 
was selected as the master clone and frozen down. 
The new ubiquitin expression vector was called pPXI. 

This parental expression vector is used 
for construction of new ubiquitin fusions for 
expression in E. coli. The target peptide is cloned 
as a fusion with ubiquitin into pPXI by synthesizing 
DNA oligonucleotides, encoding the target peptide, 
using bacterial codon usage according to reference 
28. The other essential features of the 
oligonucleotide sets are that they contain sequences 
necessary for the fusion with ubiquitin at the 5' 
end, a stop condon (TAA) at 3' end, and restriction 
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10 



sites' for cloning. A minimum of 2 oligonucleotides 
is required for short peptide extensions of 
ubiquitin. Longer peptides require the synthesis of 
4-8 oligonucleotides, up to 75 bases in length, and 
staggered such that they anneal together at the ends 
to form a single continuous coding sequence for the 
target peptide when ligated. Still larger fusions, 
of proteins (greater than 100 amino acids) with 
ubiquitin must be constructed stepwise using 
synthetic oligonucleotides or amplified by RT-PCR 
from a natural source of the coding sequence. These 
protein coding sequences are typically amplified 
with a set of mRNA-specif ic primers that flank the 
coding sequence. Following gel purification of the 
15 PCR product, a second amplification is done with 

primers hearing the essential features for cloning 
of an ubiquitin fusion as described above. 

B, Construction of an ubiquitin expression 

vector for expression in yeast. 

20 Plasmid pPIC9 was purchased from 

InVitrogen and digested with restriction enzymes. 
Xhol and EcoRI, and the linearized vector, a kb 
fragment was purified from an agarose gel, using 
Geneclean (BiolOl, Inc.). The nucleotide sequence 

25 encoding human ubiquitin, with codon usage optimized 
for expression in bacteria per Andersson and Kurland 
(28), was amplified by PCR with oligonucleotides 
specific for the 5' and 3' end, and containing 
restriction sites, Xhol and EcoRI. The PCR fragment 

30 was digested and gel purified as described for the 
vector. The vector and ubiquitin fragments were 
then ligated and the ligation mix was transformed 
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into DH5aP'IQ competent, cells obtained from BRL. 
Several colonies were screened for the insert by 
restriction digests of plasmid preparations. 
Plasmids with the correct restriction map were 
verified by DNA sequencing of the ubiquitin encoding 
DNA. Correct clones were then transformed into the 
yeast Pichia pastoris, and screened for the 
methanol -inducible expression of ubiquitin. One of 
the ubiquitin-expressing plasmids was selected as 
the master clone and frozen down. The new ubiquitin 
expression vector was called pPX200 . This parental 
expression vector is then used for construction of 
new ubiquitin fusions for expression in yeast. 

C. Construction of an ubiquitin expression 

15 vector for expression in insect cells. 

Plasmid pVLl3 93 was purchased from 
InVitrogen and digested with restriction enzymes, 
EcoRI and PstI, and the linearized vector, a 9.S kb 
fragment was purified from an agarose gel, using 
Geneclean (BiolOl, Inc.). The nucleotide sequence 
encoding human ubiquitin, with codon usage optimized 
for expression in bacteria per Andersson and Kurland 
(28) , was amplified by PCR with oligonucleotides 
specific for the 5' and 3' end, and containing 
25 restriction sites, EcoRI and Pet I. 

EXAMPLE 2 

Time course of ubiquitin fusion 
expression during fermentations. Initial 
fermentation experiments were designed to 
30 approximate shaker flask protocols as closely as 
possible in a 10 liter fermenter. This entailed 
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inoculation with a fresh 50 0 mL overnight culture 
(5% of the expression culture volume), growth at 
37°C throughout the fermentation, and induction of 
ubiquitin fusion expression at o.D. S00 =0.5 [optical 
5 density of the bacterial culture read at 600 nm=0.5 
(or about 5xl0 7 cells/ml)] with 1 mM IPTG, One 
milliliter culture samples were taken at hourly 
intervals after induction and specific yield was 
measured at these time points by densitometric scans 

10 of SDS-PAGE gels loaded with whole cells. Two 

fermentations were run, one with a glycerol feed and 
one without. Fig. 2 shows the profiles of these 
fermentations. Both cultures reached a maximum 
specific yield over 50% of the total cell protein, 

15 between 2 and 3 hours post induction, after which 
cell lysis occurred and specific yield dropped 
significantly. There was no significant benefit 
from the glycerol feed, although it was not 
optimized. All future fermentations with this clone 

2 0 were harvested after two hours of induced 
expression, prior to lysis. 

EXAMPLE 3 

Effects of temperature shifts on 
expression of ubiquitin fusions. During the course 

2 5 of optimizing expression of ubiquitin fusions in 
shaker flask cultures, it was determined that 
temperature shifts, either from the inoculum to the 
expression culture, or during the expression, upon 
induction, improved overall yields. Much higher 

30 yields were obtained when the expression is done at 
37°C versus 30°C, even when 1 mM IPTG was used in 
the 30°C culture and 5 mM IPTG was used in the 37°C 
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culture. Five fermentations were performed to test 
the effects of temperature shifts on ubiquitin 
fusion expression, as summarized in Table 1. 

All overnight inoculum cultures were grown 
5 in shaker flasks at 30°C. Three fermentation 

protocols were used. D72 represents the typical 
expression conditions, in shaker flasks, which were 
done at a uniform 37°C throughout. D73 was heat 
shocked by 5°C (from 37°C to 42°C) upon induction 
10 with 1 mM IPTG. D74 was heat shocked by 12 °C (from 
30 °C to 42 °C) upon induction with 1 mM IPTG. An 
uninduced control fermentation of A3 7 (D80) was also 
run to examine the effect of the presence of the 
vector on growth, without IPTG induction. A control 
fermentation, D81, was done with the parent strain 
DH5aF'lQ, in which a 12°C heat shock upon addition 
of 1 mM IPTG was done to simulate the most extreme 
conditions to which the test strain was subjected. 



15 



Table 1 

20 Fermentation # Strain Initial Final O.D. 600 

Temperature Temperature at ' 6C ° 

Induc- 
tion 

25 



30 



D72 


A3 7 


37°C 


37°C 


2.0 


D73 


A3 7 


37°C 


42°C 


2.0 


D74 


A3 7 


30°C 


42°C 


1.9 


D80 


A37 


30°C 


37°C 


Not 


D81 








Induced' 


DH5aF'IQ 


30°C 


42°C 


2.0 



In Fig. 4, the optical density is depicted 
over time for all fermentations. From the slope, it 
is apparent that the growth rate prior to induction 
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was uniform. In the uninduced fermentation, D80, 
and in the induced fermentation of the parent strain 
with no plasmid, D81, the final o.D. was 
approximately 5.4 units. However, the growth rate 
was reduced during the induction phase of the 
induced cultures overexpressing ubiquitin. 
Moreover, as the degree of induction or stress was 
increased, the growth rate decreased. The most 
severely affected fermentation was grown at 30 °C, 
induced with IPTG and simultaneously heat shocked to 
42°C. 

Fig. 3 shows a time course of protein 
samples taken from D72 . For each time point; the 
gel lanes represent; 1) whole cells («W M ) , 2) the 
supernatant of cell lysates ("L"), and 3) the 
supernatant of an 85°C incubation of the cell lysate 
<"H"). Therefore, the «L» and »H" lanes represent 
recoverable, soluble and heat stable protein. 
Samples for protein analysis were taken at 1 hour 
intervals prior to induction and at half hour inter- 
vals following induction. Each sample for each time 
point in each fermentation was processed 
identically. Protein concentration determinations 
were done on recovered protein samples and heat 
supernatant samples, and these data appeared 
consistent with the degree of staining in these 
samples on the gels. Five microliter volumes of 
whole cell extracts were loaded, while 10 microliter 
volumes for the "L n and "H" lanes were loaded. 
Specific yields, calculated as the percent ubiquitin 
fusion protein in total cellular protein, for each 
set of samples, were determined by densitometry on 
whole cell "W" lanes. Lane 15 contains a sample of 
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the D81 fermentation taken at the end" of the 
fermentation prior to harvest, when the cell density 
reached a maximum. Using this, a comparison of the 
host proteins in the parent strain can be made with 
5 the A3 7 expression strain. The lack of protein 

bands in the parent strain compared to the visible 
non-ubiquitin protein bands in A37, in identically 
processed samples, demonstrates a reproducible 
difference between the strains. Protein assays on 

10 these samples indicate a 2 fold difference in the 
total protein concentration, measured with the BCA 
assay. Comparable bands for host proteins, such as 
GroEl, can be visualized in DH5aF' IQ when an 100- 
fold less dilute sample is loaded per lane than A3 7 

15 samples . 

In Fig. 5, the specific yield based on 
whole cell protein and specific yield based on 
soluble protein are plotted. The specific yield 
based on whole cell protein was relatively 
20 insensitive to the induction conditions. However, 
the yield of soluble proteins increased when heat 
shock was applied. The lowest yield (-65%) was 
found in the fermentation induced by IPTG addition 
at 37°C, D73. The culture grown and induced at 37°C 
25 but also heat shocked to 42°C, D73 , resulted in 

slightly higher yield (-72%) . In the fermentation 
grown at 30°C and simultaneously induced and heat 
shocked, D74, the soluble fraction of ubiquitin was 
effectively 100%. There was no significant residual 
protein in any PAGE gels of samples from this 
experiment . 

Table 2 illustrates the relative 
advantages of the heat shock protocols . There is a 
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clear benefit to heat shock, simultaneous with 
induction of the ubiquitin fusion protein. Although 
the overall yield of biomass is lowest for the 
maximally shocked culture, D74, it yielded the 
highest amount of ubiquitin fusion protein, at 70 9 
mg/liter, constituting 16% of the wet weight of the 
cell paste. 
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10 



15 



20 





TAB 


LE 


2 








Fermentation 


A 


B 


C 


D 


E 


F 


D72 


12.6 


0 .76 


52 


.390 


55 


7.1 


D73 


11.1 


0.66 


74 


.491 


39 


12.6 


D74 


12.4 


0 .75 


95 


.709 


44 


16.1 


D80 


6.9 


0.42 


82 


.340 


71 


4.8 



A - 



B 



E - 



F - 



Micrograms of protein measured by BCA assay in 
5 microliter of each "L" sample for the last 
sample in each run. 

Conversion of column A to micrograms of protein 
per microliter of bacterial culture, taking 
.dilution factors into account. 

Percent ubiquitin fusion, versus total protein, 
estimated by densitometry in the corresponding' 
"L" sample. 3 

Specific yields of ubiquitin fusion in grams 
per liter of bacterial culture (B x -C x 1000 
mg/g) . 

Wet weight of the cell paste harvested from the 
10 liter cultures in grams. 

Percent of wet weight of cell paste comprised 
of the ubiquitin fusion. 



EXAMPLE 4 

Effect of in vivo heat shock versus 
in vitro heat shock on protein recovery. Pig. 6 
shows the effect of heat shock on the specific yield 
of the ubiquitin fusion protein, relative to total 
protein, in whole cells, lysed cell supernatants 
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(soluble protein) and heat supernatant s (heat stable 
protein) . As previously noted, the specific yields 
in whole cells and heat supernatants were roughly 
equivalent for all three growth and induction 
protocols. However, the -specific yields of 
ubiquitin fusion protein in the soluble protein 
extracts increased with the severity of the heat 
shock upon IPTG induction. D74 exhibited 
essentially 100% ubiquitin fusion protein in the 
soluble extracts, about 3 0% more than the other two 
fermentations. Curiously, the specific yield of the 
ubiquitin fusion decreased by about 20% in the D74 
sample following the in vitro heat shock. The 
practical implication of this observation is that 
the use of an in vivo heat shock is more effective 
in generating higher overall yields of soluble 
ubiquitin fusion protein than the in vitro 
purification step involving an 85°C incubation 
followed by centrifugation. Thus, the use of an in 
vitro heat precipitation step is unnecessary when an 
in vivo heat shock is done. 

EXAMPLE 5 

Comparison of parent and ubiquitin fusion 
E coll strains, A significant difference in protein 
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metabolism between the strain expressing the 
ubiquitin fusion, A3 7, and its parent strain, 
DHSaF'IQ, is apparent from our data, as shown in 
Fig. 7. A comparison of protein content from 
identically processed samples for A3 7 and its parent 
strain show that as the cell density of the culture 
increases in A3 7, the protein per unit volume also 
increases, while the protein per unit culture volume 
decreases as the cell density increases for the 
parent strain. Therefore, the protein content per 
cell. decreases with increasing cell density for the 
parent strain, reflecting the loss in metabolic 
efficiency typically seen in high density cultures 
(9) . However, uninduced A3 7, in fermentation D80, 
15 shows a similar increase in protein with cell 
density. Induced A37, in fermentations D72-74, 
shows an increase in protein greater than the 
increase in cell density. The uninduced A3 7 (D80) 
contains three times (0.5 mg/ml vs. 0.17 mg/ml) as 
20 much total soluble protein. as the parent strain 
(D81) at harvest, and the induced A37 strain (in 
D72, D73, and D74) yielded between 4.3-5.0 times as 
much soluble protein as the parent strain. The 
total amount of soluble protein is dependent on the 
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presence of the ubiquitin fusion protein as well as 
on the induction of the ubiquitin fusion protein. 

EXAMPLE 6 . 

Fermentation scale up. Model ubiquitin peptide 
5 fusion clone (A3 7) was grown in a 10 liter fermentor 
to high cell density at 37 °C prior to IPTG induction 
of the ubiquitin fusion protein, in a rich 
"superbroth" (26) . The conditions used were 
identical to those for the initial fermentation 

10 experiments with this strain. Fig. 8 shows a 

typical growth curve for this strain. The cells 
were harvested by centrif ugation 2.5 hours after 
induction with 1 mM IPTG. This generated 175 grams 
of cell paste, which was processed as a single batch 

15 for recovery of the ubiquitin fusion protein. 

Recovery of the ubiquitin fusion protein. 
Soluble protein was recovered from the cell paste 
after: 1} resuspending in 600 ml of lysis buffer 
(50 mM Tris,.pH 10), 2) 3 freeze-thaw cycles, 

20 alternating between a dry ice ethanol bath and a 
50 °C water bath, 3) followed by sonication in a 
Branson Model 3200 sonic cleaner bath at maximum 
setting for 15 min, and 4) centrif ugation at 10K for 
20' in a Sorvall RC-5B GSA rotor. The supernatant 



("first extract" , approx. 700 ml) was transferred to 
a 1 liter plastic bottle for storage at -80°C. The 
pellet was resuspended in another 3 00 ml of lysis 
buffer and the extraction process was repeated. The 
supernatant from the second procedure was labelled 
"second extract". The second pellet was also frozen 
for possible future extractions. 

Characterization of the soluble protein 
extracts. BCA assays were also done to quantitate 
the soluble protein, using a ubiquitin standard 
(Sigma) to generate the standard curve. A total of 
14 grams of protein was recovered in the first 
extract and 3 grams of protein was recovered in the 
second extract. Spectral analysis indicated that 
there was a significant amount of DNA contamination, 
approximately 110 mg/ml in each sample. Fig. 9 
shows an SDS-PAGE gel run on the first and second 
extracts. Both extracts contain virtually pure 
ubiquitin fusion protein with very little 
contaminating host proteins. This result is 
consistent with protein recoveries in previous 
fermentations in which the ubiquitin fusion is by 
far the predominant soluble protein at this stage of 
the fermentation. Nearly 17 grams of ubiquitin 
fusion protein were recovered from 175 grams of wet 
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cell paste, representing almost 10% of the wet cell 
weight. 

Pig. 10 shows HPLC traces for the D0085 
"first extract" plus a ubiquitin standard (panel A) 
and a test reaction of D0085 "first extract" with 
the cleavage enzyme, UCH-L3 (panel B) . Not only is 
the purity of the simple cell lysate good, in terms 
of contaminating proteins, but the crude lysate is a 
good substrate for UCH-L3 . The reaction goes to 
completion and the peptide is not degraded. 

EXAMPLE 7 

Bulk UCH-L3 reaction. 500 ml of the D0085 
"first extract" was brought up to 800 ml with UCH-L3 
reaction buffer (50 mM Tris, 10 mM DTT, 1 mM EDTA, 
pH 8.5) . Approximately 2 mg of UCH-L3 (1 ml of 
bacterial lysate containing UCH-L3 in catalytic 
amounts) was added and the reaction incubated for 8 
hours at room temperature. The reaction was checked 
for completion by SDS-PAGE, and this showed a small 
quantity of uncleaved fusion remaining. Another 3 
mg of UCH-L3 was added and the reaction was 
incubated at 3 0°C for another 4 hours. The reaction 
was stored at 4°C between incubations where the 
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enzyme continues to cleave peptide product from the 
ubiquitin fusion partner. 

Scale purification of peptide. One third 
of the cleavage reaction (about 250 ml)- was loaded 
onto a 15 x 7.5 cm column packed with Vydac C18 
resin (particle size of 15 microns) and run on a 
Biotage "kiloprep" instrument. A TFAracetonitrile 
gradient was then run. Peptide was recovered in 1 
liter of solvent and analyzed by analytical HPLC. 
Pig. 11 shows the chromatogram of the peptide. The 
degree of purity is excellent and nearly 100% of the 
peptide product is recovered. 



EXAMPLE 8 

Four different ubiquitin peptide fusion 
15 clones, ranging in size from 8 to 70 amino acids, 
were grown in 10 liter fermentation batches. 

Growth conditions. "Super terrific" 
broth (Sambrook, 1989/ UMD modification) (ref 26) 
was utilized in all fermentations. 500 ml starter 
cultures in 1 L shake flasks were grown from single 
colonies by overnight incubation (16-20 hours) in LB 
media (prepared according to Rodriguez and Tait 
(1983)) with ampicillin (100 /zg/ml) , neomycin (10 
Mg/ml) and 1% glucose. Shake flask cultures were 
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maintained at 30°C and 200 rpm in a New Brunswick 
reciprocating environmental shaker. Bioreactors 
were monitored and controlled to pH 7,0. Dissolved 
oxygen was maintained above 30%- of air saturation. 
The inoculum used for all experiments was 
approximately 5% v/v. Cells were induced by IPTG 
addition to 1 mM at OD $00 -5.0, 

Enzymatic reaction. Five 'hundred ml of 
the D85 lysate, containing approximately 14 grams of 
ubiquitin fusion protein, was brought up to 80 ml in 
UBP reaction buffer {50 mM Tris, 10 mM DTT, l mM 
EDTA, pH 8.5) . Approximately 2 mg of UBP enzyme 
(UCH-L3) was added and the reaction incubated for 8 
hours at room temperature. The reaction was stored 
at 4°C until preparative RP-HPLC could be done. 

Recovery of purified peptide. Preparative 
RP HPLC was done on a Biotage "kiloprep" instrument. 
250 ml of the cleavage reaction was loaded and a 15- 
40% A:B gradient was run over 45 minutes on a 15 x 
7.5 cm column packed with Vydac C18 resin (particle 
size of 15 microns) . Peptide was recovered in 640 
ml of solvent and analyzed by analytical HPLC. 

Quantitation of purified peptide. Several 
one ml aliquots of purified peptide collected from 
HPLC were lyophilized and the TFA salts were weighed 
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on a Mettler balance. The average weight of each 
aliquot was 1.7 mg, corresponding to a yield of 
1.088 grams from' approximately one third of the 
cleavage reaction (or the digestion of about 4.38 
grams of the ubiquitin fusion) . Thus, the purified 
peptide was recovered in nearly quantitative yield, 
FAB Mass spec analysis of the lyophilized 
peptide. The molecular weight of the lyophilized 
peptide was verified using FAB mass spec analysis. 
Growth curves for these fermentations are shown in 
figure 8 . The recombinant fusion proteins were 
expressed and cell pastes harvested. Induction of 
fusion protein synthesis was done at relatively low 
cell densities (O.D. 600»5 . 0) . 

Wet cell paste was processed as described 
and bacterial lysates generated by repeated freeze - 
thaw cycles. These were analyzed by SDS-PAGE and 
the lysates were found to be very concentrated. 
Figure 9 shows an SDS-PAGE gel where each sample 
lane contains 3 fil of lysate diluted 1:10 with 
water. These lysates contain at least 95% ubiquitin 
fusion protein by densitometry. The D106 yield is 
low due to toxicity of the ubiquitin fusion to host 
strain, resulting in low biomass recovered. The 
major contaminant appears to be DNA, measured at 
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100-115 Aig/ml. Estimated yields of ubiquitin fusion 
protein for each fermentation run are shown in Table 
3 . 
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TABLE 3 
YIELDS OF FUSION PROTEINS 


Run # 


Length of 
Peptide 


Grams of 
cell paste 


Grams of 
fusion protein 


D85 


17 


175 


17-52 


D86 


33 


135 


29-58 


D95 


8 


197 


60-120 


D96 


70 


247 


43-86 


D106 


53 


35 


1.5 



10 Conclusions 

1. Ubiquitin-peptide fusions, ranging in size 
from 8 to 70 amino acids are expressed at very high 
levels in E . coli. in this system, more than 10% of 
the wet weight of the cell paste is fusion protein. 

15 These fusion proteins are soluble and are recovered 
easily in simple freeze-thaw lysates. This is the 
highest reported recovery of a recombinant protein 
from E. coli . 

2. The crude extract generated by fermentation 
20 of ubiquitin-peptide fusions is extremely clean, and 

requires little, if any, yield-cutting downstream 
processing, prior to the enzymatic release of the 
peptide extension. 

3. A simple three-step process has been 
25 demonstrated with near quantitative yields of 

purified peptide product from bacterial cell pastes: 



WO 97/01627 



PCT/US96/10858 



39 

1) freeze- thaw lysis, 2) enzymatic release of 
peptide from ubiquitin, and 3) purification of the 
peptide by RP-HPLC. 

The sequences of the peptides of Table 3 are shown 
5 in Table 4 . 
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TABIiZ 4 


Size of 
Extension 


Level of 
Overexpression 




8 amino acids 


-80% of total protein 


KG DEES LA (SEQ ID NO: 2) 


17 amino acids 


-78% of total protein 


NCWGYIGERCQYRDLK 
(SEQ ID NO;3> 


17 amino acids 


-73% of total protein 


NCWGYI GERCQYRDLA 
{SEQ ID N0:4) 


22 amino acids 


-70% of total protein 


NCVVGYIGERCQYRDLKWWELR 
(SEQ ID NO: 5) 


22 amino acids 


-88% of total protein 


NCWGYIGERCQ YRDL KWWE LA. 
(SEQ ID NO:G> 


23 amino acids 


-3 8% of total protein 


GIG KFLH S AKKFGKAFVG I EMNS 
(SEQ ID NO: 7) 


28 amino acida 


-70% of total protein 


S LRRS S C FGGRMDRI GAQSGLGCNS FRY (SEQ ID 
NO: 8) 


29 amino acids 


-72% of total protein 


KS QGT FT SD YS KYLD SRRAQD FVQWLMNT (SEQ ID 
NO:9) 


33 amino acids 


-75% of total protein 


CGNLSTCMLGTYTQDFNKFHTFPQTAIGVGAPA (SEO ID 
NOilO) 


53 amino acids 


-50% of total protein 


NSDSECPLSHDGYCLHDGVCMYIEAtiDLYACNCWGYIGER 

CQYRDLKWWELR 

(SEQ ID NO: 11) 


70 amino acids 


-70% of total protein 


EICPSFQRVIETLLMDTPSSYEAAMELFSPDQDMREAGAOL 
KKLVDTLPQKPRES I IKLMEKIAQSSLCN 
(SEQ ID NO: 12) 



The lysates also were evaluated by reverse 
phase HPLC. All lysates are efficiently digested to 
ubiquitin plus product. Figure 10 shows the results 
obtained for the D85 lysate. The enzymatic cleavage 
reaction is measured by RP-HPLC, shown in panel c. 
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Analytical' reactions typically go to completion at 
room temperature in 20 minutes. The bulk cleavage 
of D85 was done over a period of several hours at a 
weight ratio of 1:7000, enzyme : substrate. Crude 
5 lysates appeared to be good enzymatic substrates, as 
evidenced by cleavage reactions in Pig. 9 (lanes 3, 
5, 7, 9, li) . a major advantage is that the 
reaction goes to completion and the peptide is not 
degraded. The reaction in lane 11 goes to 
10 completion with an extended incubation time of 120 
minutes. This reaction is slower due to the large 
size of the peptide extension of this fusion (70 
amino acids) . 

The peptide product of D85 was then 
15 purified by preparative RP-HPLC. Figure 10 shows 

the RP-HPLC analysis of the purified peptide. It is 
>95% pure and was recovered in high yield, 
corresponding to 302 mg/liter of bacterial culture. 

FAB mass spec was used to determine the 

2 0 molecular weight of the purified peptide (Figure 

11) . The product corresponds to the predicted 
molecular weight of the TFA salt of the peptide. 
FAB mass spec has been used to verify that the 
molecular weights of the BBS and D95 peptides are 
25 correct, as well, 

Analytical work shows that the correct 
product is obtained with this process. A 
preparative purification of one product demonstrates 
that the method produces an exceptionally high 

3 0 product yield. This work demonstrates the value of 

ubiquitin fusions as a peptide production method 
with potential for obtaining much higher quantities 
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of peptide from 10 liter fermentations than has 
heretofor been possible. 

DISCUSSION 

Ubiquitin fusion technology is a powerful 
bioprocess for producing proteins and peptides in 
the bacterium Escherichia coli. The two components 
of the technology are 1) a ubiquitin-specif ic prote- 
ase, which cleaves only C-terminal extensions from 
ubiquitin, and 2) an expression system consisting of 
a suitable E. coli host strain paired with a 
plasmid that encodes the ubiquitin fusion. 
Components have been put together to produce several 
peptides, ranging in size from 8 to 70 amino acids. 
A combination of the exceptional over-expression of 
the ubiquitin-peptide fusion protein in the bacteria 
and the simple, three step purification, including 
the enzymatic cleavage reaction, insures cost- 
effective production of gram quantities of peptides. 
This technology has the capacity to generate over 2 
grams of pure peptide from 12 grams of ubiquitin 
fusion, using the cell paste generated from a single 
10 -liter fermentation. 

The value of a bioprocess lies in its 
efficiency. In order to evaluate overall efficiency 
and make general comparisons to other bioprocesses, 
two numbers that effect overall yield are required. 
These are: 1) the yield of raw material that can be 
generated by manipulation of growth of the organism 
and expression of the protein product ("upstream 
processing"), and 2) the percent losses incurred in 
the purification of the final product {"downstream 
processing" ) . 



Multigram yields for upstream processing 
using ubiquitin fusion technology have been shown 
for a significant size range of C-terminal 
extensions of ubiquitin. The outstanding yields 

for these fusions illustrate that this process is 
quite generic and could be used for the production 
of peptides up to at least 70 amino acids. These 
are the highest reported recoveries of recombinant 
proteins from E. coli. 

The UCH-L3 ubiquitin-specif ic protease is 
capable of cleaving very large amounts of fusion 
protein at very high substrate to en2yme ratios 
(7000:1), under ordinary benchtop conditions. See 
Recksteiner, M, (ed.), Ubiquitin, Plenum Press, N.Y. 
(1988) . 

The crude extract generated by 
fermentation of the model ubiquitin fusion is 
extremely clean, and requires little, if any, yield- 
cutting downstream processing. The percent losses 
of sample iri downstream processing are relatively 
small and nearly 100% of product was recovered .as 
lyophilized peptide from the Biotage column- 
There are several unique features of 
ubiquitin fusion expression in fermentor cultures. 
Fermentations with ubiquitin expressing bacteria 
show an unusual growth profile. The first set of 
fermentations <D68 and D69) showed that once the 
level of ubiquitin fusion reaches an overwhelming 
majority of the total bacterial cellular protein, 
the cells die, and specific yield decreases 
dramatically in less than 30 minutes. The cells may 
be lysing from protein overload, or from a lack of 
sufficient host protein content to mediate 
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successful cell division. Fermentation parameters, 
which are monitored routinely, such as cell density, 
dissolved oxygen, and pH do not predict these 
losses. Therefore timing of the harvest is 
5 critical, and must be done no more than 2.5 hours 
after induction with 1 mM IPTG. 

The D68 and D69 data also show that, 
contrary to the common experience of losing specific 
yield productivity in fermentbr cultures, this 

10 system was equal or superior to the shaker flask 

results. Densitometry of Coomassie stained SDS-PAGE 
gels indicated a strong trend towards a vast 
majority of ubiquitin fusion over host protein 
content. The results show that the ubiquitin fusion 

15 was both soluble and recoverable, and the quantities 
recovered are consistent with initial estimates 
obtained by densitometry of SDS-PAGE on whole cells. 

Another unique feature of ubiquitin fusion 
expression is the fact that a heat shock stress 

2 0 actually augments productivity in this system. This 
phenomenon was initially observed in shaker flask 
cultures which were grown at 3 0°C prior to induction 
in order to maintain vector stability, then shifted 
to 37°C upon induction for maximal expression. 

25 Although the yield in biomass is significantly 

reduced by the severe heat shock (D74, from 30 °C to 
42°C) / the increase in specific yield of the 
ubiquitin fusion more than compensates for the loss, 
for an overall yield of 709 mg/liter, comprising 16% 

30 of the wet weight of the cell paste. It is 

surprising and unexpected that this amount of a 
recombinant protein can be produced intracellularly 
and remain soluble, especially when temperature 
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elevations during expression exacerbates the 
tendency to form inclusion bodies for many proteins 
(20A) . However, ubiquitin is an eukaryotic heat 
shock protein (l) and remains soluble for an 85 °C 
heat treatment during purification, as previously 
reported (7) . 

The fermentation protocol also effects the 
downstream purification of the ubiquitin fusion. 
The heat shock in vivo decreased the effectiveness 
of the in vitro heat treatment, with respect to 
specific yield of the ubiquitin fusion in unheated 
versus heated lysates. The heat treatment did not • 
appear to decrease the amount of the ubiquitin 
fusion in these samples, rather the in vivo heat 
15 shock appears to have increased the yield of host 
proteins in the heat treated samples versus the 
unheated samples. It appears that the physiological 
basis for this qualitative difference in host 
proteins in cells subjected to heat shock is that 
20 the in vivo heat shock has altered the host 

proteins, rendering them more soluble in a warmer 
environment, and thereby ensuring survival of the 
bacteria. Finally, the results demonstrate that the 
protein content of the induced recombinant cultures 
25 is significantly higher than that of the parent 

strain. In the induced A3 7 cultures, the protein 
content stays roughly constant as the cell density 
rises, while in the parent strain, the protein 
percentage of each cell decreases as the cell 
3 0 density increases. This decrease in protein content 
may be due in part to the severe stresses of heat 
shock and/or the chemical induction with IPTG. The 
decrease in protein content supports the view that 
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the metabolic efficiency of the cells decreases with 
increasing cell density (12, 23, 24), and helps to 
explain the lower specific yields obtained for high 
density fermentor cultures versus those obtained 
5 with shaker flask cultures where densities are much 
lower . 

In contrast, the protein content of 
ubiquitin fusion strain of the invention is not 
decreased by the induction or the heat shock. The 

10 increase in protein content is due mostly to the 

accumulation of the ubiquitin fusion, although the 
host proteins also appear to be present in higher 
amounts in this strain, relative to the parent 
strain, based on Coomassie visualization in the 

15 gels. 

3 0 peptides ranging in size from 8 to 70 amino 
acids, have been produced as ubiquitin fusions. 
Peptides of practical interest (mostly therapeutic 
applications) shown in Table 4. Theoretically, 

20 there is no size limit to the expression system. 

Compounds that could be produced using the invention 
include, inter alia, peptide hormones, antimicrobial 
peptides, peptide epitope fragments, ACTH, VIP, 
GHRH, CGRP, amylin, osteocalcin, insulin, 

25 natriuretic peptides. 
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WHAT 15 CLAIMED IS ; 

1 . A host cell comprising an expression vector 
encoding a ubiquitin-peptide fusion protein, said 
host cell being capable of expressing said protein 
in an amount of at least approximately 1 gram per 
liter of culture medium. 

2. The host cell of claim 1 which is a bacteria. 

3. The host cell of claim 2 which is E, coli. 

4. The host cell of claim 3 which is A37. 

5. The host cell of claim 1 which is a eukaryotic 
cell'. 

6. The host cell of claim 5 which is a yeast. 

7. The host cell of claim 5 which is a insect 
cell. 



!5 8. A transformed strain of E. coli that is capable 
of expressing a recombinant protein in an amount of 
at least approximately 1 gram per liter of culture 
medium. 

9. The transformed strain of claim 8 in which said 
20 recombinant protein is an ubiquitin fusion protein. 

10. A fermentation method for producing a target 
peptide from an ubiquitin fusion protein comprising: 
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a) inducing the synthesis of the fusion protein 
by a transformant contained in a fermentation 
culture; 

b) culturing the transformant under conditions 
suitable for intracellular production and 
accumulation of the induced fusion protein; 

c) recovering said fusion protein in at least 
approximately 50% specific yield; 

d) hydrolysing said induced fusion protein 
using an ubiquitin specific hydrolase to release the 
target peptide, and 

e) recovering said target peptide.. 



11. The method for producing a target peptide from 
an ubiquitin fusion protein according to claim 10 
wherein the culturing conditions selected result in 
the intracellular production and accumulation of the 
induced fusion protein in at least 50% specific 
yield. 



12. The method for producing a target peptide from 
20 an ubiquitin fusion protein according to claim 10 

wherein the culturing conditions selected result in 
the intracellular production and accumulation of the 
induced fusion protein in at least approximately one 
gram of said fusion protein per liter of said 
25 culture. 

13 . The method of claim 10 wherein the transformant 
is selected from prokaryotic or eukaryotic cells. 

14 . The method of claim 13 wherein the prokaryotic 
cell is a bacteria. 
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15. The method of claim 14 wherein the bacteria is 
E . coli. 

16. The method of claim 15 wherein the E. coli is 
A37. 

5 17. The method of claim 13 wherein the eukaryotic 
cell is plant, mammalian, yeast or insect cell. 

18. The method of claim 10 wherein synthesis is 
induced by means of a chemical agent . 

19 . The method of claim 18 wherein said chemical 
10 agent is IPTG, lactose, tryptophan, sugars and 

analogs, maltose, arabinose, nalidexic acid, 
oxalinic acid or nitrogen. 

20. The method of claim 18 wherein said chemical 
agent is IPTG or lactose. 

15 21. The method of claim 18 wherein said chemical 
agent is appropriate for any one of the following 
promoters T7, lac, lacUVS, tac, trc, trp, lexA, 
malE, araC, XpPL, and nifA. 

22. The method of claim 10, further comprising 
2 0 purifying said fusion protein using a heat step. 

23. The method of claim 22, wherein said heat step 
is carried out at about 85°C. 
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24. The method of claim 10, further comprising 
application of a physiological stress agent at the 
time of inducing synthesis. 

25. The method of claim 24, wherein the 
physiological stress agent is heat shock. 

26. The method of claim 25, wherein said heat shock 
is a temperature increase of about 12° C. 

27. The method of claim 18, further comprising 
application of a physiological stress agent at the 
time of inducing the synthesis of the fusion 
protein. 

28. The method of claim 10, wherein said target 
peptide has between 8 and 70 amino acids. 

29. The method of claim 10, wherein said target 
peptide is selected from the group peptide hormones, 
antimicrobial peptides, peptide epitope fragments, 
ACTH, VIP, GHRH, CGRP, amylin, osteocalcin, insulin, 
and natriuretic peptides. 

30. The method of claim 10, wherein said target 
peptide is selected from peptides those having 
between 2 amino acids and 6000 amino acids. 

31. The method of claim 10, wherein said target 
peptide is selected from peptides those having 
between 6 amino acids and 500 amino acids. 
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32. The method of claim 10, wherein said target 
peptide has between 8 and 70 amino acids. 

33. The method of claim 10, wherein said target 
peptide is selected from peptides those having 

5 KGDEESLA, NCWGYIGEROQYRDLK , NCWGIYGERCQYRDLA, 
NCWGYIGERCQYRDLKWWELR , NCWGYIGERCQYRDLKWWELA, 
GIGKFLHSAKKFGKAFVGEMNS , 
SLRRSSCFGGRMDRIGAQSGLGCNSFRY, 
HSQGTFTSDYSKYLDSRRAQD'FVQWLMNT , 
1 0 CHNLSTCMLGTYTQDFNKFHTFPQTAIGVGAPA, 

NSDSECPLSHDGYCLHDGVCMYIEALDLYACNCWGYIGERCQYRDLKWWEL 
R, and 

EICPSFQRVIETLLMDTFSSYEAASVIELFSPDQDMREAGAQLKKLVDTLPQ 
KPRESI IKLMEIKIAQSSLCN . 

15 34. A kit for the scale up of a fermentation 

process for producing recombinant fusion ubiquitin- 
target peptide fusion proteins comprising an 
expression vector, a nucleotide sequence encoding an 
ubiquitin and a host cell capable of expressing said 

2 0 fusion protein in an amount of at least 

approximately 1 gram per liter of culture medium. 

35. The kit of claim 33 wherein the host cell is a 
bacteria. 

36. The kit of claim 34 wherein the bacteria is E. 
25 . coli. 

37. The kit of claim 35 wherein the E. coli is A37. 

38. The kit of claim 33 wherein the host cell is 
eukaryotic cell. 
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39. The kit of claim 37 wherein the eukaryotic cell 
is a plant, mammalian, yeast or insect cell. 

40. .The method of claim 10, wherein said target 
peptide is selected from peptides those having 

5 KGDEESLA (SEQ ID NO: 2), NCWGYIGERCQYRDLK (SEQ ID 

NO: 3)., NCWGIYGERCQYRDLA (SEQ ID NO: 4), 

NCWGYIGERCQYRDLKWWELR (SEQ ID NO: 5), 

NCWGYIGERCQYRDLKWWELA (SEQ ID NO: 6), 

G I GKFLHSAKKFGKAF VGI EMNS (SEQ ID NO: 7), 

10 S LRRS S CFGGRMDRI GAQSGLGCN S PRY (SEQ ID NO: 8), 

HSQGTFTSDYSKYLDSRRAQDFVQWLMNT (SEQ ID NO: 9), 

CGNLSTCMLGTYTQDFNKFHTFPQTAIGVGAPA (SEQ ID NO: 10), 

NSDSECPLSHDGYCLHDGVCMYIEALDLYACNCWGYIGERCQYRDLKWWEL 
R (SEQ ID NO: 11) , and 

15 EICPSFQRVIETLLMDTPSSYEAAMELFSPDQDMREAGAQLKKLVDTLPQKP 
RESIIKLMEKIAQSSLCN (SEQ ID NO: 12) . 
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